Wikipedia visitor statistics, raw hourly logfiles for December 2007
Item Preview
There Is No Preview Available For This Item
This item does not appear to have any files that can be experienced on Archive.org.
Please download files in this item to interact with them on your computer.
Show all files
Share or Embed This Item
- Publication date
- 2007-12-31
- Usage
- Public Domain
- Topics
- data, analysis, statistics, user behavior, long tail, Wikipedia, web traffic
- Language
- English
- Rights
- Contains no personal data. No copyright applies, this item is in the public domain.
- Item Size
- 12.5G
Logs have also been uploaded for January 2008, February 2008, March 2008, April 2008, May 2008, June 2008, July 2008, August 2008, September 2008, October 2008, November 2008, December 2008,
January 2009, February 2009, March 2009, April 2009, May 2009, June 2009, July 2009, August 2009 and September 2009.
Filename timestamps (yyyymmdd-HHMMSS) are specified in the Gregorian calendar and UTC (Universal coordinated time, Greenwich mean time) timezone. Note that the seconds are sometimes 00 and sometimes 01. The minutes are always 00. One exception is 20081231-235959.
The files named pagecounts-yyyymmdd-HHMMSS.gz (compressed with GNU zip) contain plain ASCII text lines such as "af Sinn_F%C3%A9in 2 28163" meaning that the Afrikaans language (af) version of Wikipedia article [[Sinn Féin]] (in UTF-8, with whitespace replaced with underscores and then hex URL encoded) had 2 page views during the hour specified in the filename. A valid URL is made up of "http://" + language code (first field) + ".wikipedia.org/wiki/" + encoded article name (second field).
The files named projectcounts-yyyymmdd-HHMMSS contain plain ASCII text lines such as "af - 2701 31612142" indicating that the Afrikaans language (af) version of Wikipedia had a total of 2701 page views during the hour specified in the timestamp.
The last number on each line is either the same as the first number or the number of bytes transferred.
Starting with 20080517-100000 other projects than Wikipedia are also included, resulting in somewhat bigger files. The first field "af.b" indicates the Afrikaans language version of Wikibooks. The following kinds of project names are used (replace "af" with any applicable language code):
af - af.wikipedia.org - Wikipedia, the free encyclopedia
af.b - af.wikibooks.org - Wikibooks, a free library of educational textbooks
af.d - af.wiktionary.org - Wiktionary, the free dictionary
af.n - af.wikinews.org - Wikinews, the free news source
af.q - af.wikiquote.org - Wikiquote, the free quote compendium
af.s - af.wikisource.org - Wikisource, an online library of free content
af.v - af.wikiversity.org - Wikiversity, set learning free
www.w - the front page www.wikipedia.org
commons.m - commons.wikimedia.org - Wikimedia Commons, a database of freely usable media files
incubator.m - incubator.wikimedia.org - Wikimedia Incubator, potential Wikimedia project wikis in new language versions
meta.m - meta.wikimedia.org - Meta, a Wikimedia project coordination wiki
species.m - species.wikimedia.org - Wikispecies, a free directory of life
- Addeddate
- 2009-09-19 12:09:49
- Author
- Wikimedia Foundation
- Identifier
- wikipedia_visitor_stats_200712
- Identifier-ark
- ark:/13960/t0ks76j66
- Year
- 2007
comment
Reviews
Subject: Most recent dumps
You can compare number of files and sizes with this spreadsheet: https://spreadsheets.google.com/ccc?key=0AkbTq2pGEiCodE0yd3BWb0hrY3RQWTNMamtFOE9NRHc&hl=en#gid=0
Subject: More dumps
If you are searching for more dumps, English or other languages, and other data as visits logs, check this page[1].
Regards,
emijrp
[1] http://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive
2,302 Views
DOWNLOAD OPTIONS
IN COLLECTIONS
Wikimedia projects visitor statistics, raw hourly logfiles Wikimedia Foundation Wiki Collections Web CrawlsUploaded by aronsson on